NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

S³Attention: Improving Long Sequence Attention with Smoothed Skeleton Sketching

https://doi.org/10.1109/JSTSP.2024.3446173

Wang, Xue; Zhou, Tian; Zhu, Jianqing; Liu, Jialin; Yuan, Kun; Yao, Tao; Yin, Wotao; Jin, Rong; Cai, HanQin (August 2024, IEEE Journal of Selected Topics in Signal Processing)

Attention-based models have achieved many remarkable breakthroughs in numerous applications. However, the quadratic complexity of Attention makes the vanilla Attentionbased models hard to apply to long sequence tasks. Various improved Attention structures are proposed to reduce the computation cost by inducing low rankness and approximating the whole sequence by sub-sequences. The most challenging part of those approaches is maintaining the proper balance between information preservation and computation reduction: the longer sub-sequences used, the better information is preserved, but at the price of introducing more noise and computational costs. In this paper, we propose a smoothed skeleton sketching based Attention structure, coined S3Attention, which significantly improves upon the previous attempts to negotiate this trade-off. S3Attention has two mechanisms to effectively minimize the impact of noise while keeping the linear complexity to the sequence length: a smoothing block to mix information over long sequences and a matrix sketching method that simultaneously selects columns and rows from the input matrix. We verify the effectiveness of S3Attention both theoretically and empirically. Extensive studies over Long Range Arena (LRA) datasets and six time-series forecasting show that S3Attention significantly outperforms both vanilla Attention and other state-of-the-art variants of Attention structures.
more » « less
Full Text Available
Towards Constituting Mathematical Structures for Learning to Optimize

Liu, Jialin; Chen, Xiaohan; Wang, Zhangyang; Yin, Wotao; Cai, HanQin (July 2023, Proceedings of the 40th International Conference on Machine Learning)

Full Text Available
Learning and Preserving Relationship Privacy in Photo Sharing

https://doi.org/10.1109/BDCAT56447.2022.00029

Liu, Jialin; Li, Lin; Li, Na (December 2022, EEE/ACM International Conference on Big Data Computing, Applications and Technologies)

In recent years, Online Social Networks (OSN) have become popular content-sharing environments. With the emergence of smartphones with high-quality cameras, people like to share photos of their life moments on OSNs. The photos, however, often contain private information that people do not intend to share with others (e.g., their sensitive relationship). Solely relying on OSN users to manually process photos to protect their relationship can be tedious and error-prone. Therefore, we designed a system to automatically discover sensitive relations in a photo to be shared online and preserve the relations by face blocking techniques. We first used the Decision Tree model to learn sensitive relations from the photos labeled private or public by OSN users. Then we defined a face blocking problem and developed a linear programming model to optimize the tradeoff between preserving relationship privacy and maintaining the photo utility. In this paper, we generated synthetic data and used it to evaluate our system performance in terms of privacy protection and photo utility loss.
more » « less
Full Text Available
Learning to Optimize: A Primer and A Benchmark

Chen, Tianlong; Chen, Xiaohan; Chen, Wuyang; Heaton, Howard; Liu, Jialin; Wang, Zhangyang; Yin, Wotao (June 2022, Journal of machine learning research)
David Wipf (Ed.)
Learning to optimize (L2O) is an emerging approach that leverages machine learning to develop optimization methods, aiming at reducing the laborious iterations of hand engineering. It automates the design of an optimization method based on its performance on a set of training problems. This data-driven procedure generates methods that can efficiently solve problems similar to those in training. In sharp contrast, the typical and traditional designs of optimization methods are theory-driven, so they obtain performance guarantees over the classes of problems specified by the theory. The difference makes L2O suitable for repeatedly solving a particular optimization problem over a specific distribution of data, while it typically fails on out-of-distribution problems. The practicality of L2O depends on the type of target optimization, the chosen architecture of the method to learn, and the training procedure. This new paradigm has motivated a community of researchers to explore L2O and report their findings. This article is poised to be the first comprehensive survey and benchmark of L2O for continuous optimization. We set up taxonomies, categorize existing works and research directions, present insights, and identify open challenges. We benchmarked many existing L2O approaches on a few representative optimization problems. For reproducible research and fair benchmarking purposes, we released our software implementation and data in the package Open-L2O at https://github.com/VITA-Group/Open-L2O.
more » « less
Full Text Available
Deep learning for procedural content generation

https://doi.org/10.1007/s00521-020-05383-8

Liu, Jialin; Snodgrass, Sam; Khalifa, Ahmed; Risi, Sebastian; Yannakakis, Georgios N.; Togelius, Julian (January 2021, Neural Computing and Applications)
null (Ed.)
Full Text Available
General Video Game AI: A Multitrack Framework for Evaluating Agents, Games, and Content Generation Algorithms

https://doi.org/10.1109/TG.2019.2901021

Perez-Liebana, Diego; Liu, Jialin; Khalifa, Ahmed; Gaina, Raluca D.; Togelius, Julian; Lucas, Simon M. (September 2019, IEEE Transactions on Games)

Full Text Available
Plug-and-Play Methods Provably Converge with Properly Trained Denoisers

Ryu, Ernest; Liu, Jialin; Wang, Sicheng; Chen, Xiaohan; Wang, Zhangyang; Yin, Wotao (January 2019, Proceedings of Machine Learning Research)

Full Text Available
H elix: accelerating human-in-the-loop machine learning

https://doi.org/10.14778/3229863.3236234

Xin, Doris; Ma, Litian; Liu, Jialin; Macke, Stephen; Song, Shuchen; Parameswaran, Aditya (August 2018, Proceedings of the VLDB Endowment)

Full Text Available
Theoretical Linear Convergence of Unfolded ISTA and Its Practical Weights and Thresholds

Chen, Xiaohan; Liu, Jialin; Wang, Zhangyang; Yin, Wotao (January 2018, Advances in neural information processing systems)

Full Text Available
First- and Second-Order Methods for Online Convolutional Dictionary Learning

https://doi.org/10.1137/17M1145689

Liu, Jialin; Garcia-Cardona, Cristina; Wohlberg, Brendt; Yin, Wotao (January 2018, SIAM Journal on Imaging Sciences)

Full Text Available

« Prev Next »

Search for: All records